Text Extraction From Documents


Text extraction from documents is the process of extracting text data from scanned documents or images.

Generative Compositor for Few-Shot Visual Information Extraction

Add code
Mar 21, 2025
Viaarxiv icon

MarkushGrapher: Joint Visual and Textual Recognition of Markush Structures

Add code
Mar 20, 2025
Viaarxiv icon

DocVideoQA: Towards Comprehensive Understanding of Document-Centric Videos through Question Answering

Add code
Mar 20, 2025
Viaarxiv icon

Real-world validation of a multimodal LLM-powered pipeline for High-Accuracy Clinical Trial Patient Matching leveraging EHR data

Add code
Mar 19, 2025
Viaarxiv icon

Narrative Trails: A Method for Coherent Storyline Extraction via Maximum Capacity Path Optimization

Add code
Mar 19, 2025
Viaarxiv icon

Predicting Cardiopulmonary Exercise Testing Outcomes in Congenital Heart Disease Through Multi-modal Data Integration and Geometric Learning

Add code
Mar 18, 2025
Viaarxiv icon

Applications of Large Language Model Reasoning in Feature Generation

Add code
Mar 15, 2025
Viaarxiv icon

ARLED: Leveraging LED-based ARMAN Model for Abstractive Summarization of Persian Long Documents

Add code
Mar 13, 2025
Viaarxiv icon

A Hybrid Architecture with Efficient Fine Tuning for Abstractive Patent Document Summarization

Add code
Mar 13, 2025
Viaarxiv icon

KAP: MLLM-assisted OCR Text Enhancement for Hybrid Retrieval in Chinese Non-Narrative Documents

Add code
Mar 11, 2025
Viaarxiv icon